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PRODUCTION OF HUMAN RECOMBINANT COLLAGEN 
IN THE MILK OF TRANSG ENIC ANIMALS 

This is a continuation-in-part of United States 
Serial Number 08/011, 643. filed 28 January 1993, the 
5 contents of which are incorporated herein by reference. 

Technical Field 

The invention relates to production of recombinant 
proteins, specifically collagen, in the milk of a 
transgenic mammal. More specifically, it concerns 
10 methods to prepare purified forms of useful human 

collagen by effecting the secretion of the collagen (or 
procollagen) into the milk of a transgenic mammal. 

Background Art 

Collagen is a major structural protein useful in 

15 reconstructive therapeutic procedures in humans. 

Collagens used for these purposes are generally prepared 
by isolating the material from tissues of farm animals 
such as cows or pigs. While such isolated collagen has 
been used with some success, it is essentially a protein 

20 foreign to the treated human being and immunogenic 

responses can be a problem. This problem has been 
minimized by treating the animal -derived collagen with 
proteolytic enzymes to decrease immunogenicity . 

It is clear tHat it would be advantageous to supply 

25 human rather than bovine or porcine collagen for 

therapeutic purposes. The sources for purified human 
collagen are limited and the only reliable source is 
human placenta. Human collagen can be purified from 
human placenta as described in copending U.S. Patent 

30 Application Serial No. 07/921,810 (Collagen Corporation), 

The placenta contains several types of collagens, most 
notably types I, III, IV, and V. The process of 
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separating and purifying one type from the others is 
imperfect and results in a predominant type with small 
amounts of the other types. Production of purified 
collagen from placentas further necessitates additional 
5 processing steps to ensure that the resulting collagen 

product is free from human viruses such as hepatitis and 
HIV. In view of this, there have been attempts to 
prepare human collagen using recombinant techniques. 

Expression of the human cartilage procollagen gene 

10 (Col2Al) in mouse 3T3 cells been reported (Ala-Kokko, L. 

et al., J Biol Chem (1991) 266:14175-14178). Olsen, A.S. 
et al . reported expression of a minigene version of the 
human proal(I) collagen gene in mouse fibroblasts (Olsen, 
A.S. et al., J Biol Chem (1991) 266:1117-1121). Full- 

15 length human proa2 (V) collagen cDNA in proa2 (V) -deficient 

hamster cells was reported by Greenspan, D.S. et al . , J 
Biol Chem (1989) 264 : 20683-20687 ; mouse fibroblasts have 
also been used to express the proal(I) chain wherein the 
resulting expressed protein is complexed in the collagen 

20 triple helix with murine proa2(I) chains, as described by 

Schnieke et al . , Proc Natl Acad Sci USA (1987) 84:8869- 
8873. Transgenic mice that were modified to contain a 
mutated form of the proal(I) gene were not viable after 
birth, according to a study by Stacey, A. et al . Nature 

25 (1988) 322 : 131-136 . In addition, transgenic mice have 

been obtained that express a minigene version of the 
human gene for type I procollagen systemically (Khillan, 
J.S. et al., J Biol Chem (1991) 2£6:23373-23379) ; PCT 
application W092/22333. These mice are useful as model 

30 systems for investigating bone diseases characterized by 

the modified collagen produced. 

The production of recombinant human collagen is made < 
troublesome by the necessity for a multiplicity of 
posttranslational enzymes which are generally believed to 

35 be present only in cells which natively produce collagen. 
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At least eight such posttranslational enzymes are 
believed to be needed (Prockop et al . , New England J Med 
(1984) 111:376-386). This has limited attempts at 
recombinant production to cells which natively produce 
5 this protein; this inevitably results in chimeric forms 

of the protein. 

In order to avoid chimeric collagens which contain 
partly human and partly host mammalian cell chains in the 
triple helix, it might be possible to use human cells for 

10 this production. Even in this case, however, it is not 

possible to obtain collagen product of a particular type 
free of other collagen types. As further described 
below, the variety of collagens produced and their innate 
similarity makes homogeneous preparations from either 

15 native or recombinant sources which produce their own 

collagen impossible . 

The present invention solves these problems by 
effecting the synthesis of human procollagen or collagen 
in cells which do not natively produce this protein, 

20 employing techniques established for the production of 

foreign proteins in mammalian milk, as described in the 
publications cited hereinbelow. The collagen of 
designated types is secreted into the milk either as 
procollagen or collagen, depending on the construction of 

25 the expression systems and accompanying recombinant 

enzyme production. 

Disclosure of the Invention 

The invention provides recombinant production of 
human collagen in a form that permits isolation of a 
30 homogeneous collagen type and can be designed to effect 

the production of commercially practical amounts of these 
proteins at a reasonable cost. The invention utilizes 
systems developed for the production of recombinant 
proteins in mammalian milk and requires utilization of 
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these techniques not only to effect the expression of the 
gene encoding the desired collagen, but also, if 
required, expression of the gene for any required 
posttranslational enzymes. 
5 Thus, in one aspect, the invention is directed to a 

method for the recombinant production of human 
procollagen or collagen comprising recovering milk from 
the mammary glands of a nonhuman mammal. The mammal will 
have been modified to contain an expression system that 

10 comprises DNA encoding human procollagen under the 

control of regulatory sequences operable in mammary 
glands. The human procollagen or collagen produced is 
recovered from the milk by various purification 
techniques. The nonhuman mammal may also be modified if 

15 necessary to contain an expression system for the 

production of any needed posttranslational enzymes in the 
milk protein-secreting cells of the mammary glands. 

Either collagen or procollagen may be secreted 
depending on the presence or absence of suitable 

20 proteases in the cell. The procollagen encoded in the 

nucleotide sequence contained in the expression system 
will be preceded by a nucleotide sequence encoding an 
appropriate signal, either that natively associated with 
the procollagen or an alternate signal sequence workable 

25 in the targeted cells. Thus, the procollagen produced as 

a result of a recombinant expression will be secreted 
into the milk. If the host cells contain enzymes which 
ordinarily effect cleavage of the prosequences from 
collagen -- i.e. procollagen N-protease and/or 

3 0 procollagen C-protease, the procollagen will be cleaved 

of the prosequences as it exits the cell and collagen 
will be secreted into the medium. However, if these 
enzymes are absent from the production cell, procollagen 
itself will be secreted. Low levels of these proteases 

35 will result in mixtures of collagen and procollagen. 
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Apparently the levels of these enzymes vary in cells 
which natively produce collagen. Depending on the tissue 
and the developmental stage of the subject from which the 
tissue originates, a greater or lesser proportion of 
5 procollagen or collagen will be contained in the secreted 

materials. Thus, the milk which contains the collagen of 
the invention will contain this collagen in the form of 
human collagen per se, human procollagen per se or a 
mixture of both. 

10 While it would be possible to modify the native 

procollagen genes to delete the coding sequences for the 
prosequences, it is not desirable to do this since the 
pro-region, especially the C-terminal pro-region, 
mediates the formation of triple helixes by the collagen 

15 portion of the molecule. Thus, if the prosequences are 

deleted from the expression vector, the resulting single 
collagen chains would be unable to form the triple helix 
which characterizes the collagen molecules. 

If procollagen is secreted into the milk, of course, 

20 by supplying the appropriate proteolytic enzymes, 

collagen will result. 

In another aspect, the invention is directed to 
expression systems useful in the foregoing method which 
comprise a DNA sequence encoding human procollagen 

25 operably linked to a promoter and other regulatory 

sequences capable of effecting expression in mammary 
glands. If necessary, expression systems operable in 
mammary glands for production of posttranslational 
enzymes can also be used. The invention also is directed 

30 to nonhuman embryonic stem (ES) cells and to nonhuman 

eggs, including fertilized forms, modified to contain the 
expression system as well as to the nonhuman mammal 
implanted with the fertilized egg or with a blastula 
including the ES cells. 
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In other aspects, the invention is directed to milk 
containing human procollagen or collagen, and to 
homogeneous forms of human procollagen or collagen. 
These forms are made available by the practice of the 
5 invention method which permits the production of only the 

recombinant collagen type desired absent a background of 
either similar nonhuman collagen molecules, or of 
collagens of different types. 

Modes of Carrying Out the Invention 

10 Collagen is a well studied protein, and the 

expression of genes encoding collagen has also been 
reviewed recently (Adams, S.L., Amer J Respir Cell and 
Molec Biol (1989) 1:161-168). This review summarizes the 
types of collagen known to occur and describes their 

15 common features. The mRNAs encoding collagens of various 

types are translated in the cytoplasm of collagen- 
producing cells into procollagen subunits which are then 
assembled into triple helices. The assembled procollagen 
contains propeptide extensions at the N and C termini 

20 that help to assemble the subunits, but do not 

participate in the triple helix. The prosequences are 
then cleaved to obtain collagen triple helix as the 
procollagen is secreted. The collagen helix itself 
contains nonhelical extensions designated telopeptides . 

25 The triple helical regions contain repeating amino acid 

sequences with a glycine in every third position and 
proline (P) or hydroxyproline (HP) often in the other 
positions so as to contain a sequence of "triplets" of 
the form -(GXY) n -, wherein X or Y or both are P or HP. 

3 0 One of the essential posttranslational steps is the 

conversion of some proline residues to hydroxyproline to 
ensure stability of the triple helix at body temperature. 
Other important posttranslational modifications are 
disulfide exchange, hydroxylation of lysyl residues, 
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addition of carbohydrate and the assembly and 
crosslinking of the triple helical collagen molecules. 

According to the Adams review, thirteen genetically 
distinct collagen types have been described and represent 
5 the products of at least 23 genes. The most common types 

found in interstitial tissues are types I, III, V and VI; 
in cartilage, types IT, IX, X and XI are found. Some of 
these types exist natively as homotriplexes ; others are 
heterotriplexes . 

10 The nomenclature for the various collagen types is 

designed to designate the genetic origin of the collagen 
in question. For example, the triple helix of type I 
collagen is a heterotriplex containing the products of 
two different collagen-encoding genes. This type of 

15 collagen is designated [a,(I)] 2 a 2 (I); thus, type I 

collagen triplexes contain two chains encoded by the 
CollAl gene and one protein chain encoded by the CollA2 
gene. Type III collagen is designated [a, (III) 3 3 and is 
thus comprised of three identical chains translated from 

20 the Col3Al gene. Type II collagen is also a homopolymer 

designated [a,(II)] 3 which is comprised of translation 
products of the Col2Al gene. 

Since collagen-producing cells, as described above, 
produce several types of collagen, it has, in the past, 

25 been impossible to obtain, for example, homogeneous type 

I collagen free of type III collagen. By producing 
collagen in noncollagen-producing cells according to the 
method of the invention, obtaining such homogeneous 
preparations becomes possible. 

30 The genetic materials for use in the method of the 

invention encoding the desired collagens are available. 
The genes encoding human types I, II, III, IV and V 
collagen are currently available. 

Prockop, DJ et al . ( supra ) list the following 

3 5 cotranslational and posttranslational modifications that 
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occur when collagen is produced in fibroblasts: cleavage 
of signal peptides at the N- termini of the chains, 
hydroxylation of the Y-position proline and lysine 
residues, hydroxylation of a few X-position proline 
5 residues, addition of galactose or galactose and then 
glucose to some of the hydroxylysines, addition of a 
mannose-rich oligosaccharide to the C propeptides, 
association of the C-terminal propeptides through a 
process directed by a structure of these domains, and 

10 finally formation of both intrachain and interchain 

disulfide bonds in the propeptides. After secretion of 
the procollagen, the N propeptides are cleaved by a 
procollagen N proteinase and the C propeptides by a 
separate procollagen C proteinase. The collagen then 

15 self -assembles into fibrils, and lysyl oxidase converts 

some lysine and hydroxylysine residues to the aldehyde 
derivatives to form cross- links with similar residues in 
adjacent molecules. 

It is not entirely clear whether mammary cells, 

20 since they do not endogenously produce collagen, contain 

the enzymes necessary for these posttranslational events . 
Since the assembly into triplexes is mediated by the 
sequences of the C-terminal extensions, in the event the 
epithelial cells of the mammary glands lack the required 

25 proteases, it is believed that the assembly into 

triplexes can be effected extracellularly by providing 
appropriate secretion signals to the procollagen molecule 
as stated above and adding suitable proteases . 
Alternatively, the proteases could be produced 

30 recombinantly in the epithelial mammary cells. The 

enzymes most likely to be needed by the mammary cells in 
order to effect required posttranslational processing are 
protein disulfide isomerase and the a-subunit of prolyl 
hydroxylase. If these enzymes are not endogenously 

35 produced and must be provided recombinantly, expression 
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systems for their production may be supplied along with 
the expression systems for the collagen or procollagen 
itself. The gene for the a-subunit of prolyl hydroxylase 
has not yet been completely described but the gene 
5 encoding the protein disulfide isomerase has been 

partially sequenced as described by Tasanen, K. , et al, J 
Biol Chem (1988) 263:16218-16224 and J Biol Chem (1992) 
267 : 11513-11519. Genes encoding both proteins can be 
obtained using standard techniques. These two enzymes 

10 function together as a tetrameric protein comprising two 

subunits of prolyl hydroxylase noncovalently associated 
with two a subunits of protein disulfide isomerase. 
Although the two subunits of protein disulfide isomerase 
are functional as a dimer, the two a subunits of prolyl 

15 hydroxylase must be associated with protein disulfide 

isomerase in order to be active (Vuari, et al., 1992). 

A well developed system for use in the invention 
method utilizes milk production in cows. This system is 
summarized by Krimpenfort, P. et al. in Biotechnology 

20 (1991) 2:844-847. This article describes microinjection 

of fertilized bovine oocytes with genes encoding human 
proteins and development of the resulting embryos in 
surrogate mothers. The human genes were fused to the 
bovine otS x casein regulatory elements. This general 

25 technology was also described in PCT Application 

WO91/08216 published 13 June 1991 and assigned to 
GenPharm . 

Additional descriptions of the production of 
recombinant proteins by developing transgenic animals 

3 0 which secrete the proteins into milk are found in 

European Application 264166 published 20 April 1988, 
assigned to Integrated Genetics. This disclosure 
emphasizes use of whey acid protein control systems to 
effect protein secretion and cites use of this system for 

35 the production of tPA and Hepatitis B surface antigen in 
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goat milk. Analogous systems for production of foreign 
proteins are described in PCT application WO88/00239 
published 14 January 1988 and assigned to Pharmaceutical 
Proteins Limited. This application describes procedures 
5 for obtaining suitable regulatory DNA sequences for the 

products of the mammary glands of sheep, including beta 
lactoglobulin, and describes the construction of 
transgenic sheep modified so as to secrete foreign 
proteins in milk. An additional application, PCT 

10 WO88/01648, published 10 March 1988 and assigned to 

Immunex Corporation, generally describes construction of 
transgenic animals which secrete foreign proteins into 
milk under control of the regulatory sequences of bovine 
alpha lactalbumin gene. Finally, PCT application 

15 WO88/10118, published 29 December 1988 and assigned to 

Biogen, describes construction of transgenic mice and 
larger mammals for the production of various recombinant 
human proteins in milk. Other publications which 
describe the production of various proteins in milk 

20 include Archibold, A.L. et al . Proc Natl Acad Sci USA 

(1990) .87:5178-5182 which describes production of human 
a-antitrypsin in the milk of transgenic mice. This 
production utilized a hybrid gene constructed from the 
j8-lactoglobulin gene fused to an al -antitrypsin minigene. 

25 Pittius, C.W. et al. Proc Natl Acad Sci USA (1988) 

85.: 5874-5878 describe production of tissue plasminogen 
activator in the mammary glands of transgenic mice using 
the murine whey acidic protein promoter. Hennighausen, 
L, Protein Expression and Purification (1990) 1:3-8 

30 provides a review of the use of the mammary gland as a 

bioreactor and the production of various foreign proteins 
in milk. This article describes the factors that affect 
the level of production and indicates recommended forms 
of expression system construction. The disclosures of 
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the foregoing publications are incorporated herein by 
reference . 

Thus, techniques for construction of appropriate 
host vectors containing regulatory sequences effective to 
5 produce foreign proteins in mammary glands and cause the 

secretion of said protein into milk are known in the art. 
In addition, techniques for constructing transgenic 
mammals containing these systems, including mice as well 
as larger mammalian species such as cows, sheep and 
10 goats, are well known. 

Systems for the expression of the procollagen gene 
in cells that produce milk protein can be constructed 
using methodology analogous to that recently described 
for the production of human collagenase in the lungs of 
15 transgenic mice (D'Armiento et al . , Cell (1992) 71 ; 955- 

961) . 

Genes encoding a number of procollagen types have 
been obtained; and genes for additional types can be 
obtained similarly. The preparation and cloning of the 

20 human CollAl gene has been described (Barsh et al . , J 

Biol Chem (1984) 259:14906-14 913). Briefly, a human 
genome cosmid library is packaged and used to transduce 
E. coli, which are plated, grown, and screened using a 
nucleic acid sequence specific for the CollAl gene. 

25 Positive colonies are located, matured in broth, and the 

DNA isolated. Restriction endonucleases are used to cut 
the DNA at selected^ sites . The digested DNA is examined 
by gel electrophoresis and DNA sequencing. A cosmid 
clone CG 103 isolated from a human genomic library was 

30 shown to contain the entire human CollAl gene. 

Fragments of collagen genes have been selected from 
cosmid libraries (Barsh et al . , supra ) and from 
bacteriophage libraries (Chu et al . , J Biol Chem (1985) 
260:4357-4363 for type III collagen; Chu et al . , Nature 

35 (1984) 310:337-340 for type I collagen). The CollAl gene 
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was obtained in three overlapping genomic clones using 
the Charon 4 A bacteriophage vector. The CollA2 gene has 
also been obtained from five overlapping clones in Charon 
4A libraries (deWet et al., J Biol Chem (1987) 
5 262:16032-16036). It has been shown that the first 

intron is important in regulating the al(l) gene 
expression in a tissue-specific manner in transgenic mice 
(Slack, J.L. et al. Mol Cell Biol (1991) 11:2066-2074). 
As an alternative to using the entire gene, full- 

10 length cDNAs could be used, although the use of the 

entire gene has been shown to be more effective in 
transgenic animal experiments (Palmiter et al., Proc Natl 
Acad 5c i USA (1991) 88:478-482) . Such a full-length cDNA 
can be isolated from cDNA libraries, as was done for the 

15 cDNA for the alpha-2 chain of type I collagen (Lee et 

al., J Biol Chem (1988) 263:13414-13418), which was 
isolated from a lambda phage library. 

To construct an expression system compatible with 
the epithelial cells of mammary glands, the CollAl or 

20 other procollagen gene, as a DNA fragment is ligated to a 

similarly prepared DNA fragment containing the promoter 
and any additional required regulatory sequences for a 
milk- specif ic protein expression. As described by 
D'Armiento et al . , when ligating a promoter to a gene, it 

25 is necessary to preserve the translational start site for 

the protein. This may be accomplished by introducing a 
specific restriction endonuclease site immediately 
preceding the translation start site that is also unique 
for the 5' end of the chosen promoter. When these 

3 0 fragments are prepared using such a restriction 

endonuclease, the sites at the 3' end of the promoter 
will be compatible with the 5' end of the CollAl gene. 
When ligation occurs, the promoter will be ligated at the 
correct site of the gene to encode a messenger RNA that 

35 will allow translation from the translation start site of 
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the procollagen gene, analogous to ligation of the 
heptoglobin promoter to the human collagenase gene 
described by D'Armiento et al . , supra . The promoter-gene 
construct is ligated into a bacteriophage vector cloning 
5 system by treating the phage DNA with a restriction 

endonuclease; both ends of the foreign DNA are then 
ligated to the vector construct for cloning the DNA. 

CDNA containing the translation start site for 
expressed messenger RNA can also be ligated to a promoter 

10 to prepare a functional construct for introduction into a 

transgenic animal. This method was used for the human 
lactoferrin cDNA fused to the bovine alpha Sl-casein gene 
5' and 3' untranslated regions (Krimpenf ort ) . 

It is also understood that upstream regions of the 

15 promoter may be involved in regulating gene expression. 

Specifically, it has been shown that the extracellular 
matrix and hormones regulate the expression of bovine 
/J-casein by their influence on the upstream sequences in 
the relevant gene (Schmidhauser, C. et al. Proc Natl Acad 

20 Sci USA (1990) 87:9118-9122). In addition, signals for 

termination of transcription and translation are also 
helpful in elevating levels of expression. 

In order to reduce the size of the procollagen gene 
so that the construct could be cloned in bacteriophage, 

25 the gene itself could be shortened by reducing the size 

of the introns. This could be done for procollagen genes 
that are cloned as overlapping fragments. The introns at 
the junction sites of the fragments could be identified 
and treated with specific endonuclease to shorten the 

3 0 introns, but leave restriction sites that are compatible 

for ligation. Restriction sites could be altered by 
site-directed mutagenesis (D'Armiento et al., supra ) to 
generate restriction sites for ligation of the fragments 
of the procollagen gene into a single construct. Another 

35 method of accomplishing the removal of introns is to 
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prepare fusion genes containing cDNAs to replace two or 
more exons within the gene. 

One of the posttranslational modifying enzymes 
necessary for the production of collagen is protein 
5 disulfide isomerase, which, when combined with the alpha 

subunit of prolyl hydroxylase, forms a tetrameric protein 
isolated as prolyl hydroxylase. The gene for protein 
disulfide isomerase has been obtained from a human 
genomic library produced in a cosmid vector pcos 2EMBL 

10 (Poustka et al., Proc Natl Acad Sci USA (1984) £1:4129- 

4133) . The library was screened with cDNA fragments 
specific for human protein disulfide isomerase and 
several clones were obtained, at least two of which 
contained the entire gene (Tasanen et al . , J Biol Chem 

15 (1988) 263.:16218-16224) . 

For use in the expression systems of the invention, 
this gene can be cut from the cosmid DNA with restriction 
endonucleases and ligated to a milk-specific protein 
promoter using a strategy similar to that used for the 

20 construct of the heptoglobin-collagenase DNA. 

In the event that the mammary cells are unable to 
provide suitable enzymes for posttranslational 
modification of the procollagen produced, the transgenic 
animals would need to be modified with expression systems 

25 for these enzymes. Construction of these expression 

systems is analogous to that described herein for 
procollagen gene expression. The expression systems for 
the posttranslational enzymes are provided to the 
transgenic animal along with the expression systems for 

30 the desired collagen product. 

The choice of a promoter for expression in milk 
would preferably be from one of the milk-specific 
proteins, such as alpha Sl-casein 5' and 3' regulatory 
sequences, which were fused to the human lactoferrin 

35 cDNA, providing a construct that used the alpha Sl-casein 
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promoter and signal sequence for the human lactoferrin 
gene. Another construct used to express a foreign 
protein in sheep milk consisted of the sheep beta- 
lactoglobin promoter fused to human and antitrypsin gene 
5 fragments (Wright et al . , Biotechnology (1991) 9:830- 

833) . A third promoter that has been used is the whey- 
acid promoter, which was fused to cDNA for a modified 
version of human tissue plasminogen activator (Ebert et 
al., Biotechnology (1991) 9:835-838) and used to prepare 

10 transgenic goats in whose milk human tissue plasminogen 

activator was expressed. The sequence of the gene is 
scanned for available unique restriction endonuclease 
sites, which are selected so that the functional gene 
containing the precise translation start site is 

15 preserved in the mRNA. 

In the event that it is. desirable to provide 
posttranslational enzymes in the mammary cells, it is 
believed that the most important candidates are prolyl 
hydroxylase and protein disulfide isomerase. The gene 

2 0 for the chick alpha subunit of prolyl hydroxylase has not 

yet been completely isolated, but is known to be as large 
as 50kb (R.A. Berg unpublished information) . It is 
expected that the entire gene may be obtained from a 
human genomic cosmid library, as was done for the CollAl 
25 gene and the gene for protein disulfide isomerase. The 

cDNA for chick alpha subunit (Bassuk et al . , Proc Natl 
Acad Sci USA (1989) 86:7382-7386) and human alpha subunit 
(Helaakoski, T., Proc Natl Acad Sci USA (1989) 86:4392- 
4396) have been described. Since the gene is not yet 

3 0 available, the cDNA for the human alpha subunit for 

prolyl hydroxylase can be fused to the promoter for a 
milk-specific protein to produce a DNA construct for 
introduction into a transgenic animal. 

Using these systems, animals are obtained which 
35 secrete human collagen or procollagen into milk. The 
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gene encoding the desired procollagen chain is coupled to 
suitable control sequences which function in the mammary 
cells of mammalian species such as the regulatory 
sequences associated with the aSl casein gene, 
5 jS-lactalbumin or a-lactalbumin genes, 0-lactoglobin or 
lactoferrin genes. Both 5' and 3' regulatory sequences 
can be used. The genes encoding the required 
posttranslational enzymes are similarly constructed into 
expression systems using mammary cell -specif ic regulatory 

10 sequences. 

The resulting expression systems are microinjected 
using, for example, the technique described in U.S. 
Patent 4,873,191. The expression system constructs are 
amplified by PCR or cloning and purified by agarose gel 

15 electrophoresis. After electroelution, the concentration 

is adjusted to 1-10 /xg/ml and microinjected into the 
oocytes which are obtained from ovaries freshly removed 
from cows or other animals. The oocytes are aspirated 
from the follicles and allowed to settle before 

20 fertilization with thawed frozen sperm capacitated with 

heparin and pref ractionated by Percoll gradient to 
isolate the motile fraction. 

The fertilized oocytes are centrifuged, for example, 
for eight minutes at 15,000 x g to visualize the 

25 pronuclei for injection and then cultured from the zygote 

to morula or blastocyst stage in oviduct tissue - 
conditioned medium. This medium is prepared by using 
luminal tissues scraped from oviducts and diluted in 
culture medium. The zygotes must be placed in the 

30 culture medium within two hours following microinjection. 

Estrous is then synchronized in the intended 
recipient mammals such as cattle by administering * 
coprostanol . Estrous is produced within two days and the 
embryos are transferred to the recipients 5-7 days after 

35 estrous. 
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Successful transfer can be evaluated in the 
offspring by Southern blot. By utilizing this system to 
effect the expression of the CollAl gene, for example, 
the offspring can be evaluated for the presence of the 
5 CollAl gene by Southern hybridization using a CollAl gene 

derived probe. 

Alternatively, the desired constructs can be 
introduced into embryonic stem cells (ES cells) and the 
cells cultured to ensure modification by the transgene. 

10 The modified cells are then injected into the blastula 

embryonic stage and the blastulas replaced into 
pseudopregnant hosts. The resulting offspring are 
chimeric with respect to the ES and host cells, and 
nonchimeric strains which exclusively comprise the ES 

15 progeny can be obtained using conventional cross- 

breeding. This technique is described, for example, in 
PCT Application WO91/10741, published 25 July 1991. 

For production of the desired procollagen or 
collagen in milk, expression systems for both the 

20 procollagen gene and the posttranslational enzyme- 

encoding genes must be present in the transgenic animal . 
There are several ways to achieve this . 

First, the mammalian host may already produce the 
required levels of posttranslational enzymes in the 

25 epithelial cells of the mammary glands. Alternatively, 

the constructs to be microinjected into eggs or 
transfected into ES cells may include a cocktail of the 
desired procollagen gene expression system along with the 
expression systems similarly constructed for, for 

3 0 example, the prolyl hydroxylase and protein disulfide 

isomerase. The successful production of collagen in the 
milk can then be determined using antiprocollagen 
antibodies or by analysis of the milk for levels of 
hydroxyproline, a unique amino acid found in collagen as 

35 a result of the activity of prolyl hydroxylase. 
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In another alternative, the expression systems for 
the procollagen gene and the expression systems for any 
needed posttranslational enzyme -encoding genes may be 
injected into different batches of fertilized eggs or 
5 transfected into different batches of ES cells and used 
separately as described above to develop transgenic 
animals capable of expressing the procollagen or collagen 
genes and the posttranslational enzyme -encoding genes, 
respectively. These transgenic animals can then be 

10 crossbred and the offspring evaluated for the ability to 

express both such systems. At least some of the 
offspring of such transgenic animals will be capable of 
producing both the collagen product and the 
posttranslational enzyme product. 

15 In still another approach, fertilized eggs or ES 

cells may be prepared from transgenic animals already 
modified to have the capacity to express one or the other 
of the procollagen genes or the posttranslational enzyme - 
encoding genes. These eggs can then be microinjected or 

20 the ES cells transfected with the expression system for 

the proteins lacking in the transgenic animal to develop 
into a transgenic animal containing expression systems 
for all of the required components. 

Similarly, transgenic animals already modified with 

25 respect to one desired gene may be used as sources for 

the blastulas into which modified ES cells are implanted. 
Again, chimeric animals will result which can be used in 
cross-breeding to obtain offspring having genes for all 
of the desired proteins. 

30 It may be noted that the expression systems for both 

of the particular posttranslational enzymes described 
above, if needed, must be provided essentially 
simultaneously since the enzymes function together as a 
tetrameric protein; as described above, the two a 
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subunits of prolyl hydroxylase must be associated with 
protein disulfide isomerase in order to be active. 

When suitable transgenic mammals have been obtained 
by any of the foregoing methods, the procollagen or 
5 collagen is secreted into the milk. The procollagen or 

collagen product of the transgenic mammal will be 
determined by the nature of the procollagen gene in the 
expression system provided. For homotriplexes , only a 
single gene is inserted. For production of 

10 heterotriplexes, such as typical human collagen type I, 

either both the CollAl and CollA2 genes are utilized in 
the original microinjection, or mammals transgenic for 
human CollAl are crossbred with mammals transgenic for 
human CollA2 . The type III collagen gene Col3Al can be 

15 used to prepare a transgenic animal and may be simpler 

because only one collagen polypeptide chain is required. 

For the procollagen genes provided in the expression 
systems, procollagen is secreted into the milk if the 
required proteases for conversion to collagen are absent. 

20 To the extent that these protease enzymes are absent from 

the secreting epithelial cells and are not provided for 
by recombinant systems, procollagen is secreted into the 
milk and can be recovered in a manner analogous to 
procedures that would be used for collagen per se. The 

25 procollagen can also be converted before or after 

purification using specific proteases to cleave the 
prosequences as is known in the art. On the other hand, 
if the proteases are natively present intracellularly or 
are provided by recombinant systems, collagen will be 

30 secreted directly. Depending on the levels of these 

enzymes, mixtures of procollagen and collagen may be 
obtained in the milk which can, if desired, be converted 
by treatment of the milk with proteases to convert all of 
the relevant molecules to collagen per se. 
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As described above, previous preparations of human 
collagen of a given type are always contaminated by the 
presence of alternative type collagens in view of the 
similarity of these materials and in view of the capacity 
5 of native or other recombinant cells previously used to 

produce collagens encoded by their own genomes. By use 
of the method of the "invention, it is possible to obtain 
collagen or procollagen of a given type free from 
coexpressed collagens or procollagens of alternative 
10 types. 

Purification of collagen or procollagen from milk is 
accomplished using their characteristic solubility and 
chemical properties. For example, milk may be acidified, 
causing milk- specif ic proteins such as casein to 

15 precipitate and collagen or procollagen to remain in 

solution. The collagen or procollagen may be 
precipitated from acid solutions by the addition of salt, 
alcohol, or propylene glycol. (Miller, E.J. and Rhodes, 
R.K., Methods in Enzvmoloav (1982) 82:33-64); Sage, H. 

20 and Bernstein, P., ibid . , 96-127.) 
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Claims 

1. A method to prepare human collagen or human 
procollagen which method comprises: 

recovering milk from the mammary glands of a 
5 nonhuman mammal which mammal has been modified to contain 
an expression system which comprises a coding nucleotide 
sequence encoding at least one human procollagen operably 
linked to control nucleotide sequences that effect 
expression specifically in milk protein-secreting 
10 epithelial cells of said mammary glands under conditions 
wherein said coding nucleotide sequence is expressed to 
secrete human procollagen or human collagen into the milk 
of said mammal ; and 

recovering the human procollagen or human collagen 
15 from the milk. 

2. An expression system for production of human 
procollagen or human collagen in milk which expression 
system comprises a coding nucleotide sequence encoding 
human procollagen operably linked to a promoter capable 

20 of specifically effecting expression in milk protein- 
secreting cells of mammary glands. 

3 . The expression system of claim 2 wherein the 
human procollagen is the pro-al chain of type I collagen 
or is the pro-al chain of type III collagen. 

25 4. A fertilized nonhuman egg containing the 

expression system of claim 2. 

5. A, nonhuman embryonic stem cell modified to 
contain the expression system of claim 2. 
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6 . A transgenic nonhuman mammal which comprises 
the expression system of claim 2. 

7. The nonhuman mammal of claim 6 which has been 
further modified to contain at least one expression 

5 system which effects the production of posttranslational 

modification enzymes for procollagen. 

8. Milk containing human collagen or human 
procollagen. 

9. A human procollagen or human collagen 

10 composition containing only procollagen or collagen 

molecules of a single collagen type. 
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